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Background: Verbal autopsy (VA) is the only available approach for determining the cause of many deaths, 
where routine certification is not in place. Therefore, it is important to use standards and methods for VA that 
maximise efficiency, consistency and comparability. The World Health Organization (WHO) has led the deve- 
lopment of the 2012 WHO VA instrument as a new standard, intended both as a research tool and for routine 
registration of deaths. 

Objective: A new public-domain probabilistic model for interpreting VA data, InterVA-4, is described, which 
builds on previous versions and is aligned with the 2012 WHO VA instrument. 

Design: The new model has been designed to use the VA input indicators defined in the 2012 WHO VA 
instrument and to deliver causes of death compatible with the International Classification of Diseases version 
10 (ICD-10) categorised into 62 groups as defined in the 2012 WHO VA instrument. In addition, known 
shortcomings of previous InterVA models have been addressed in this revision, as well as integrating other 
work on maternal and perinatal deaths. 

Results: The InterVA-4 model is presented here to facilitate its widespread use and to enable further field 
evaluation to take place. Results from a demonstration dataset from Agincourt, South Africa, show continuity 
of interpretation between InterVA-3 and Inter VA-4, as well as differences reflecting specific issues addressed in 
the design and development of InterVA-4. 

Conclusions: InterVA-4 is made freely available as a new standard model for interpreting VA data into causes 
of death. It can be used for determining cause of death both in research settings and for routine registration. 
Further validation opportunities will be explored. These developments in cause of death registration are likely 
to substantially increase the global coverage of cause-specific mortality data. 
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Verbal autopsy (VA) covers the entire process 
of interviewing close caregivers - relatives, friends 
or witnesses - about the details of a death that 
occurred and then uses the interview data to arrive at a 
probable cause of death. This is not a new concept, and 
although VA might not necessarily be the best or most 
accurate approach for determining individual causes of 
death, it is nevertheless a much better option than simply 
allowing deaths to pass unrecorded. Provided VA is 
implemented with reasonable rigour and consistency, it 
leads to information on cause-specific mortality patterns 
which are otherwise unavailable. The need for more 
comprehensive death registration, and the potential con- 
tribution of VA in this, is described in a recent blog (1). 

In recent years increasing attention has been paid 
to computerised procedures for determining cause of 
death from VA data (2). VA data have commonly been 
interpreted by physicians, but this has proved to be 
a costly, slow and non-reproducible process in many 
situations, yielding VA cause of death information that 
cannot readily be compared between settings (3). Pro- 
ponents of physician interpretation argue that a more 
nuanced approach to detailed causes of death (including 
co-morbidities) can be achieved by physicians consider- 
ing a narrative account of the circumstances leading to 
death (4), while computerised processes can be compara- 
tively fast, cheap and reproducible over time and place, 
and also speed up VA interviews by obviating the need 
for transcribing lengthy narratives (5). 

Over the past decade, a series of Inter VA models have 
been developed using Bayesian probabilistic modelling 
as a means of interpreting VA data to derive causes of 
death. Starting on a very experimental basis with VA data 
from Vietnam (6), these models progressed to InterVA-3 
which has been widely used in a variety of settings across 
Africa, Asia and Latin America (7-1 5) and an associated 
model, InterVA-M, which dealt separately with deaths 
among women of reproductive age (16, 17). Additional 
developments for handling neonatal deaths have also 
been reported previously (18). 

In parallel with these technical developments for 
ascertaining cause of death, concern has grown about 
the large proportion of deaths in the world that are not 
certified by cause and which therefore contribute nothing 
to global evidence on cause of death and the implications 
for public health. It is also clear that the chances of a 
death being registered are strongly determined by geo- 
graphic location and socio-economic status (19), meaning 
that widening the scope of death registration (including 
cause determination) also represents a means of reducing 
global bias in cause of death information. This has led the 
World Health Organization (WHO), the Health Metrics 
Network and the United Nations Statistical Commission 



to seek ways forward for extending routine death registra- 
tion on a much wider scale. Hitherto VA has been largely 
used in research settings, such as the Health and Demo- 
graphic Surveillance Sites of the INDEPTH Network 

(20) . Now, particularly with the development of shorter 
and more automated methods, VA offers new opportu- 
nities for wider implementations of routine cause of 
death registration, not only in research environments. 

This paper presents the underlying principles of the 
latest model in the InterVA suite, InterVA-4, which inte- 
grates experience accumulated from previous versions, 
latest data and research findings, and revisions by an 
expert panel. The InterVA-4 model is freely available and 
can be downloaded from www.interva.net. The Inter VA-4 
User Guide is attached to this paper as Supplementary 
Material. As well as reflecting previous experience, this 
latest model has been constructed to reflect the structure 
of the 2012 WHO Verbal Autopsy instrument, developed 
in an expert consultation in Geneva in December 2011 

(21) . Inter VA-4 aims to provide a consistent and generally 
applicable means of interpreting VA data, being modelled 
on (but not restricted to) the 2012 WHO Verbal Autopsy 
instrument, and hence applicable prospectively and retro- 
spectively. The model is intended for use both within 
already-enumerated populations and as a stand-alone 
death registration tool, both in research and in civil 
registration. 

Probabilistic methods 

Bayes' theorem (22) links the probability of an event 
happening given a particular circumstance with the 
unconditional probability of the same event and the con- 
ditional probability of the circumstance given the event. 
If the event of interest is a particular cause of death, 
and the circumstance is part of the events leading to 
death, then Bayes' theorem can be applied in terms of 
circumstances and causes of death. 

Specifically, if there are a predetermined set of possible 
causes of death Q ... C m and another set of indicators 
Ii . . . I n representing various signs, symptoms and cir- 
cumstances leading to death, then Bayes' general theorem 
for any particular C, and Ij can be stated as: 



P(Cj|L,) : 



P(IjlQ) x P(C.) 

PdjIQ) x P(C,) + PdjHQ) x P(!Q) 



where P(!Q) is (1 -P(Q)). 

Over the whole set of causes of death d ... C m a set 
of probabilities for each Q can be calculated using 
a normalising assumption so that the total conditional 
probability over all causes totals unity: 



P(Q|L) : 



PdjlCj) x p(c.) 

£r=,p(c,) 
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Using an initial set of unconditional probabilities for 
causes of death d . . . C m (which can be thought of as 
P(C;|I 0 )) and a matrix of conditional probabilities P(Ij|Q) 
for indicators I ; . . . I n and causes Q . . . C m , it is possible 
to repeatedly apply the same calculation process for each 
Ii . . . I n that applies to a particular death: 

PdJQ) x P(Q|I 0 „„_,) 

(lll - n) " e^cjCj 

This process typically results in the probabilities of 
most causes reducing, while a few likely causes are char- 
acterised by their increasing probabilities as successive 
indicators are processed. 

The lnterVA-4 model 

Inter VA-4 aims to be a tool for interpreting VA data which 
can be applied simply, quickly and cheaply to generate 
cause of death data, compatible with the International 
Classification of Diseases version 10 (ICD-10). It offers 
the consistency and reproducibility that is characteristic 
of mathematical models and thus facilitates comparisons 
of VA cause of death between different places and over 
time. Furthermore, as a public-domain resource it is freely 
available and runs on any Windows®-based personal 
computer, or under the Mac OS® as a virtual Windows® 
instance. It builds on experience with previous InterVA 
models but brings a new standard of VA interpretation by 
conforming to the 2012 WHO VA instrument in terms of 
cause of death categories and input indicators. Indicators 
relating to durations in the WHO instrument are split into 
pre-defined categories for processing in InterVA-4. Tables 
of cause of death categories and input indicators from the 
2012 WHO VA instrument as handled by Inter VA-4 are 
included in the InterVA-4 User Guide (Supplementary 
Material). 

For deaths among women of reproductive age, a 
separate sub-model is incorporated which uses exactly 
the same methodology over three pregnancy status out- 
comes to assess the likelihood of pregnancy being as- 
sociated with the death (but not necessarily in a causal 
sense). This was previously implemented in InterVA-M 
and is important in cases where pregnancy status at death 
may be ambiguous in the VA data (16). 

Apart from the mathematics, the major challenge in 
building a probabilistic model covering all causes of 
death to a reasonable level of detail lies in populating 
the matrix of conditional probabilities P(Ij|Q). There is 
no overall source of data available which systematically 
quantifies probabilities of various signs, symptoms and 
circumstances leading to death in terms of their associa- 
tions with particular causes. Therefore, these conditional 
probabilities have to be estimated from a diversity of 
incomplete sources (including previous InterVA models) 
and modulated by expert opinion. In the various versions 
of InterVA that have been developed, expert panels have 



been convened to capture clinical expertise on the rela- 
tionships between indicators and causes (23). In this case, 
an expert panel (DC, AH, SH, AK, KK) convened in 
Geneva in December 2011 and continued to deliberate 
subsequently, particularly considering issues that built 
on previous InterVA versions. Experience has shown that 
gradations in levels of perceived probabilities correspond 
more to a logarithmic than linear scale (24), and in the 
expert consultation for InterVA-4, we used a perceived 
probability scale that was subsequently converted to 
numbers on a logarithmic scale as shown in Table 1. This 
was similar to scales used previously but was extended 
into additional low probability categories. We have also 
demonstrated by means of sensitivity analyses within the 
probability matrix that although the conditional prob- 
ability values are important, the model does not require 
a high level of precision for these estimates (25). 

As was the case in InterVA-3 (5), there are special 
arrangements for two causes of death (HIV/AIDS and 
malaria), the occurrences of which vary appreciably in 
different locations. As described in the Inter VA-4 User 
Guide, the model has a facility for setting the uncondi- 
tional probabilities for each of these causes to reflect their 
occurrence in a particular population. Conceptually, this 
is analogous to a physician's background knowledge 
of local disease profiles, irrespective of the details of 
a particular case. In the new model, the setting for 
malaria is also applied to the unconditional probability 
for deaths due to sickle cell disease, because of the 
genetically determined geographical overlap between the 
two diseases (26). 

Having experienced difficulties in previous versions of 
the software with proprietary data file formats, which tend 
to change in unpredictable ways over time, InterVA-4 



Table 1. Qualitative probability scale used as the basis for 
eliciting expert opinion on probabilities 



Label 


Value 


Interpretation 


1 


1.0 


Always 


A+ 


0.8 


Almost always 


A 


0.5 


Common 


A- 


0.2 




B + 


0.1 


Often 


B 


0.05 




B- 


0.02 




C + 


0.01 


Unusual 


C 


0.005 




C- 


0.002 




D + 


0.001 


Rare 


D 


0.0005 




D- 


0.0001 




E 


0.00001 


Hardly ever 


N 


0 


Never 
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has reverted to text with comma-separated values (CSVs) 
for data input and output, as the lowest common 
denominator in file formats. CSV data can be relatively 
easily read and written by most analysis software pack- 
ages, and examples of Stata scripts for generating 
InterVA-4 input and processing output from InterVA-4 
in a standard way, deriving cause-specific mortality 
fractions (CSMFs), are included in the user guide. 

Having calculated a set of conditional probabilities 
P(Q|Ii . n) for causes Q . . . C m for each case, methods 
have to be applied for interpreting the output in ways that 
make sense from medical cause of death and public health 
perspectives. In the interests of transparency, InterVA-4 
automatically outputs the complete P(Q|Ii . . . n ) data grid 
for all causes and cases in a batch as a CSV file. However, 
this is not a very convenient format for routinely analysing 
cause of death data; hence, the InterVA-4 software also 
undertakes basic post-processing of the basic cause of 
death data into a more readily useable format. Although 
experience shows that the majority of cases arrive at a 
single overwhelmingly likely cause, some cases, particu- 
larly if there is scanty or somewhat contradictory infor- 
mation available, may end up with two or three causes 
of comparable likelihood. This is analogous to a physi- 
cian's differential diagnosis in which ambiguity remains 
between multiple likely causes. In a further minority of 
cases, there is no high likelihood cause, amounting to no 
clear conclusion on cause of death. To handle all this in a 
standardised way, InterVA-4 post-processes likely cause 
data in the same way that we have previously discussed in 
relation to InterVA-3 (5). If no single cause has a final 
likelihood of at least 0.4, the cause of death is considered 
to be indeterminate. The 0.4 cut-off is considerably higher 
than the unconditional probability of any cause, and con- 
ceptually includes a level approaching a 50:50 possibility 
for a particular cause, while leaving scope for other likely 
causes to be included in the overall consideration of the 
case. Multiple causes are only reported if they reach half 
of the likelihood of the leading cause. Any residual margin 
of likelihood not accounted for by the likelihood of the 
first and, where applicable, second and third causes can 
then be considered as a partial indeterminate component 
in analysing overall cause of death and CSMF patterns. 
We suggest that this is a much more appropriate approach 
than aggregating the sums of small residual probabilities 
of unlikely causes, which can lead to misleading results 
over large numbers of cases. 

Since an important part of the objectives behind this 
model is to extend the use of VA into routine death 
registration procedures, additional non-medical questions 
about circumstances of death have been included in the 
2012 WHO VA instrument to facilitate the interpreta- 
tion of cause of death in non-enumerated populations. 
Although these are not in any way intended to replace 
the developing role of social autopsy (27), future devel- 



opments in describing and modelling non-medical factors 
associated with cause of death are anticipated and will 
be reflected in future developments of InterVA. 

Comparing InterVA-3 and InterVA-4 

As with any software tool, one of our aims in developing 
this new version of InterVA was to improve the scope 
of the tool and address known shortcomings of the 
previous version. The differences between InterVA-3 and 
InterVA-4 are considerable, even though the same basic 
mathematical model lies at the heart of both versions. 
Both the range of causes of death and the scope of in- 
put indicators are considerably expanded in the new 
version and brought into line with the 2012 WHO VA 
instrument, but this makes a direct and meaningful 
comparison between the versions difficult. To demon- 
strate the similarities and differences, we have taken 
a random sample of 1,000 cases from the Agincourt, 
South Africa, dataset previously analysed (5). These cases 
have been run with the InterVA- 3 and Inter VA-4 models, 
but without adding data for the additional InterVA-4 
indicators. Age-specific CSMFs from both models for 
these same 1,000 cases are shown in Table 2. This de- 
monstrates the more detailed classification of neoplasms 
in the new model, as well as the detailed maternal causes 
brought in from InterVA-M. This illustrative dataset 
included 44/62 of the 2012 WHO VA instrument possible 
causes of death. 

Discussion 

The new InterVA-4 model presented here represents 
a substantial advance in automated cause of death 
modelling. It is specifically aligned to the new WHO 
2012 Verbal Autopsy instrument (21), although it can be 
used retrospectively for processing archived VA data, 
provided a reasonable spread of the necessary indicators 
are available. It offers the advantages of standardised 
interpretation over time and place; is freely available as a 
public-domain resource; and on a typical computer can 
process around 10,000 cases per hour. These attributes 
mean that it offers new opportunities for cause of death 
data to be gathered routinely in settings where hitherto 
there have been few or no such records and where 
physicians to assess VAs are few. The speed of processing 
can yield cause of death information with the potential 
to feed into local and national health policy development 
in a timely fashion. Standardised assessment means that 
trends in causes of death can be tracked over time and 
can be compared across different settings. We have 
previously shown that using a standard model across a 
wide range of settings yields appreciably different, and 
plausible, cause of death profiles (28). 

The potential disadvantage of using any mathematical 
model for interpreting cause of death is that some 
subtlety and nuance may be lost as compared to routine 
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Table 2. Comparison of age- and cause-specific mortality fractions (CSMF) using the Inter VA-3 and InterVA-4 models on 1,000 randomly selected VA records from a previous 
analysis of data in Agincourt HDSS, South Africa (12) 
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These demonstration data included 44/62 of the cause of death categories in the 2012 WHO VA instrument and lnterVA-4. 
CSMFs are rounded to nearest 1 %, 0 representing a finite value <0.5%. 
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physician certification of cause of death (either from 
attending physicians or from physicians coding individual 
VA cases). It is certainly true that mathematical models 
will fail to identify very unusual or complex causes of 
death. However, such cases tend to be of more local or 
esoteric interest rather than contributing significantly to 
the public health understanding of population mortality 
patterns. 

Considering the validity of any VA approach raises 
difficult issues. VA is normally used in settings where 
other sources of information on circumstances of death 
are lacking, precluding many direct comparisons. The VA 
interview stage carries inherent uncertainty as the quality 
of information obtained depends on a variety of factors 
including relationship of respondent to deceased, knowl- 
edge of signs and symptoms during terminal illness, recall 
and willingness to disclose information, especially for 
conditions that are stigmatised or have culturally sensitive 
connotations (29). The absolute validity of deaths certi- 
fied by attending physicians is by no means a given, and 
studies have shown considerable inconsistencies between 
hospital causes of death and pathologists' findings (30, 31). 
The validity of physician interpretation of VA material 
has also been shown to be questionable (3) and subject 
to inter-observer variation (12). It is also clear that any 
approach to VA data must involve some degree of 
capturing expert opinion, alongside the use of established 
information (32). Previous InterVA models have been 
extensively evaluated against local physician interpre- 
tations, with generally concordant findings (5, 7, 10, 12, 
14, 15). One evaluation suggested that InterVA-3 did not 
perform well against VA data from tertiary health facilities 
(33), but those data were not available for further study. 
Nevertheless, certain shortcomings have been identified in 
Inter VA-3, including over-diagnosis of meningitis, under- 
diagnosis and lack of differentiation between various 
cancers, and fairly high levels of indeterminate cases. 
InterVA-4 is designed to handle these more effectively, 
and we will continue to explore realistic opportunities for 
further validations. 

The comparison between InterVA-3 and InterVA-4 
demonstrated in Table 2 says nothing about the absolute 
validity of either model or about the mortality profile in 
the Agincourt population. It does demonstrate, however, 
the general continuity in results between the two versions 
of the model, when using exactly the same input data, 
as well as evidence of deliberately introduced changes 
in response to some of the shortcomings identified in 
Inter VA-3. This comparison does not necessarily make 
the most of some of the new aspects of Inter VA-4, since 
data for new indicators were not included in order 
to demonstrate continuity. We will continue to make 
InterVA-3 and InterVA-M available as legacy downloads, 
but we recommend the use of InterVA-4 wherever 
possible. 



InterVA-4 is launched here as a global resource. It 
offers substantial benefits over the default situation of 
not routinely capturing cause of death data (which is 
the de facto standard in many resource-limited parts 
of the world), considerable improvements over previous 
versions and the ability to more reliably compare causes 
of death across different settings and over time. This is 
important for research, for example, as INDEPTH cross- 
site research accelerates (20), and also offers new oppor- 
tunities for cause-specific civil registration of deaths on 
a much wider scale. Work is in progress on a portable 
electronic version of the model which will be particularly 
useful for routine registration. In the longer term, there 
will undoubtedly be further evaluations and develop- 
ments as the science and methods of attributing cause 
of death improve. We will continue to work with old and 
new collaborators on further refinements in this process. 
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